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Abstract. Several types of term rewriting systems can be distinguished 
by the way their rules overlap. In particular, we define the classes of 
prefix, suffix, bottom-up and top-down systems, which generalize similar 
classes on words. Our aim is to study the derivation relation of such sys- 
tems (i.e. the refiexive and transitive closure of their rewriting relation) 
and, if possible, to provide a finite mechanism characterizing it. Using a 
notion of rational relations based on finite graph grammars, we show that 
the derivation of any bottom-up, top-down or suffix systems is rational, 
while it can be non recursive for prefix systems. 



1 Introduction 

Word rewriting systems are among the most general formalisms found in com- 
puter science to model word transformations. They generalize grammars, can 
represent the runs of finite automata, transducers, pushdown automata or even 
Turing machines. They can thus be considered as a unifying framework to com- 
pare all these heterogeneous formalisms. For instance, [6j proposes a homoge- 
neous presentation of several well-known families of infinite graphs, using an 
approach based on word rewriting systems proposed in [1] , which is to consider 
the 'Cay ley graph' of a rewriting system. In another paper [5], a classification of 
word rewriting systems according to the way their rules overlap is established. 
It is proved that the derivation relations of four classes of systems are rational, 
which means that they can be generated by finite transducers. These systems 
called left, right, prefix and suffix, were later used in [Sj. Any other class is shown 
to contain at least one system whose derivation is not rational. 

The aim of this work is to extend these results from words to terms. To 
summarize, we will be interested in term rewriting systems whose derivation 
can be characterized by a finite mechanism. First of all, we have to specify 
which definition of rationality for relations on terms we intend to use, as several 
distinct notions already exist (see [13] for an overview). Unfortunately, none 
of them is as widely adopted as the standard one for words, as each relies on 
different characteristics of the word case, and serves a different purpose. In this 
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paper, we will adopt a notion introduced in [14 , which makes use of hyperedge 
replacement graph grammars. The reason for this choice is the close similarity 
between the way these grammars work, and the asynchronous mechanism of a 
word transducer. Then we extend the definitions of left, right, prefix and suffix 
systems to terms, yielding what we will call bottom-up, top-down, prefix and 
suffix systems, and investigate the rationality of their derivation relations. We 
also mention recognizability preservation properties for bottom-up. top-down 
and suffix systems. 

Numerous works deal with term rewriting systems. Among the closest to our 
approach, we can mention for instance [lOj and [15| . which specifically investi- 
gate the recognizability preservation properties of term rewriting systems. Both 
papers study classes of systems which properly include the class of top-down 
systems, and prove that they preserve recognizability. However, the derivation 
relations of these systems are not rational (more generally, no finite representa- 
tion of these relations is given) . On the contrary, Dauchct and Tison extensively 
studied ground term rewriting systems, i.e. systems whose rules do not contain 
variables [8]. In particular, they proved that these systems have a decidable first 
order theory with reachability [5] by explicitely building their derivation relation. 
From another point of view, |12] and [7] investigated the geometric properties 
of transition graphs of ground systems and compared this family of graphs with 
respect to other well-known families. Note that by definition, ground systems are 
a special kind of suffix systems. Finally, we can mention the theme of symbolic 
model-checking, whose main idea is to represent regular sets of configurations by 
finite word automata and system transitions by rewrite rules or transducers (see 
for example This field is currently being extended to systems with richer 
topologies, like trees |ll3j . A central problem relevant to this method is to com- 
pute the set of configurations reachable in any number of steps when starting 
from a regular set of configurations (for instance a recognizable term language). 

This paper is organized as follows: after recalling a few basic notions about 
trees, terms and recognizable languages, we present the notion of rationality for 
term relations introduced in [14j . In Section 3] we introduce term rewriting sys- 
tems, and detail the four subclasses we consider. The last two parts present our 
results concerning the rationality of the derivations of top-down, bottom-up and 
suffix systems, as well as remarks concerning their preservation of recognizability. 

2 Terms and Trees 

Let F = IJ„>Q Fn be a finite ranked alphabet, each F„ being a set of function 
symbols of arity n (elements of Fq are constants), and X be a finite set of variable 
symbols. The set T(F, X) of finite first-order terms on F with variables in X is 
the smallest set including X and satisfying f & Fn A ti, . . . ,tn & T{F,X) 
/ii . . . t„ G T{F, X). The set T(F, X)+ of tuples of terms will be called the set of 
term words. A term word t = (ti, . . . , t„) is usually noted ti . . . tn, and t{i) is used 
to denote ti. The dimension of t is called its length and noted |t| (here |t| = n). 
Term words containing no variable are called ground. The set of ground terms is 



noted T{F, 0) or simply T{F). The set of variables actually occurring in a term or 
term word t is Var(t), and t is said linear if each of its variables occurs only once. 
If moreover t has n variables, it is called a n- context. The variables of a 71-context 
are conventionally noted □!,...□„. The set of n-contexts is denoted by C„(F), 
the set of all contexts by C{F). A common operation on terms is substitution. A 
substitution is fully defined by a mapping from X to T{F,X), and extended to 
a morphism as follows: we note ta the application of a substitution a to a term 
word t, which is done by replacing every occurrence of each variable x occurring 
in t by the term (7{x). The set of substitutions over F and X is noted S{F,X). 
For any term word s ~ si . . . s„ and when t is a n-context, we use t[s] as a 
shorthand notation for the variable substitution t{Di 1^ Si \ i E [l,n]}. All these 
notations are extended to sets of term words in the usual way. A term, term 
word, context or substitution is said to be proper or non-trivial if it contains at 
least one symbol in F. 

Let N be the set of strictly positive integers, we call position any word in 
the set N*. Every term t in T{F, X) can be represented as a finite ordered tree 
whose nodes are labeled by symbols in F or variables in X, or equivalently as a 
mapping from a prefix-closed set of positions Pos{t), called the domain of the 
term, to the set F U X. Let t = /(ti, . . . , i„, . . .) be a term represented by an 
ordered tree, position e denotes the root of t, and for 71 G N, p G N*, position np 
denotes the node at position p in subtree t„. Seeing terms as trees, term words 
can be seen as ordered forests. In the following, we will use the prefix partial 
order on positions, noted ^: let p and q be two positions, p ^ q if there is some 
q' € N* such that p = qq' . If furthermore q' ^ e, we write p > q. We denote 
by pos{x,t) the set of positions at which the variable x d X occurs in term 
t e T{F,X). 

The most common acceptors for languages of trees (and thus terms) are 
finite tree automata. Among several variants, we will only consider top-down 
tree automata, defined by a finite set Q of control states and a finite set R 
of transition rules of the form qf — > fqi . . . qn where f G F^ {n can be 0) 
and q, qi, . . . qn S Q. A configuration is an embedding of control states in the 
input tree, i.e. a tree from T{F U Q,X), where each g G Q is considered a 
unary function symbol. A rule qf fqi . . . qn can be applied in configuration 
Ci to reach configuration C2 if ci = t[qfti . . . i„] and C2 = t[fqiti . . . qntn] for 
any context t and term word ii . . . t„. A run of the automaton is a sequence of 
applications of rules on a given input. A ground term t G T(F, X) is accepted or 
recognized if there is a run from configuration qot (where qo is an initial control 
state) to configuration t. The set of terms accepted by a tree automaton A is 
called the language of A and noted L{A). The languages accepted by finite tree 
automata are called recognizable. 

3 Rational Tree Relations 

Several authors have tried to define suitable notions of binary relations over 
terms generalizing known families of relations over words, like for instance the 



recognizable relations, or the more general rational relations (i.e. relations recog- 
nized by finite transducers). As of now, no extension to terms is really considered 
canonical, as each family of relations has its own merits and drawbacks. Several 
distinct families can be encountered: recognizable relations as such, relations 
defined as rational languages over some overlap coding of both projections of 
the relation, relations induced by various types of tree transducers, or the more 
specific class of ground tree transductions, to cite but a few (see |13j for a survey). 

In [14j . a notion of rationality for tuples of trees according to the union, 
substitution and iterated substitution operations is proposed. This notion can 
also be seen as a definition for binary rational relations over tuples of trees, and 
thus as a special case, binary relations over trees. Similarly to the word case, 
this class is strictly more general than the class of recognizable relations. In his 
paper, Raoult proves that the rational languages of tuples can be generated using 
a special kind of hyperedge replacement grammars. This definition is justified by 
its similarity to rational word relations on several aspects: first, as it should be, 
it coincides with rational word relations when restricted to trees of degree one. 
Second, it is closed under projection on any number of components, union and 
intersection. Finally, its mechanism is indeed quite close to the way a transducer 
works. However, this generality has a cost, and this class of relations is not closed 
under composition. 

First, we need to define the product operation we shall use to define rational 
sets, which is an extension of the usual substitution operation. Let t be a term 
word, X a word of n variables having k > instances xi . . . Xk in t (i.e. a total 
of ri * fc variables), and M a set of n-tuplcs of terms. We define t-^M as the set 
of tuples of terms obtained by replacing each instance of a; in t with a (possibly 
different) element of M. Formally: t -x M :— {t{si{j) ^ Xi{j) \ i G [l,fc], j G 
[l,n]}}. It is extended to sets in the usual way: L -x M :— {t -x M \ t € L}. 
Furthermore, define L"^ := L -x L""^^ and L*^ := Un>o We are now ready 
to define the notion of rationality associated to this product: 

Definition 3.1 ( |,14j ). The set Rat,,, of rational languages of n-tuples of trees is 
the smallest set of languages containing the finite languages of tuples and closed 
under the following operations: 

1. L e Ratn A M e Ratn LUM e Rat,, 

2. L e Ratn A a; e A Me Rat.„, L -x M e Ratn 

3. L e Ratn A a; e X" =^ L*- e Ratn 

The family Rat of rational languages over tuples of terms is the union of all 
Ratn, for n > 1. 

One should note that this notion of rationality difi^ers from the one defined 
in [TT], for example, as the concatenation (or 'series') product is not directly 
taken into account, and substitution is done simultaneously on several vari- 
ables. From this definition arises a straightforward notion of rational expres- 
sion, which extends the usual notion on words. It should be noted that Rati 
does not coincide with the set of recognizable term languages. For example, on 



E = {f'^^\g'-^\h''^\a'-°'>}, the language fg'^-ag^a £ Rati is defined by the ratio- 
nal expression /□in2[5ni.gn2]* [aa], but it is not a recognizable term language. 

Let us now recall the liyperedge replacement grammars used in |14| . which 
generate the rational languages of tuples of terms. In this paper, we will call 
grammar a hyperedge replacement grammar such that every production {A, a) 
has the following properties: 

— the terminal subgraph of a, say at, obtained by removing all non-terminal 
hyperedges from a, is an ordered forest with n connex components (a n-tuple 
of trees), where n is the arity of A, 

— the vertices of a belonging to a hyperedge are leaves of at, 

— no vertex of a belongs to more than one hyperedge. 

These properties allow us to refer to the right-hand sides of this type of grammars 
as 'leaf-linked forests'. The definition of grammar derivation is the usual one for 
hyperedge replacement. It will be useful to also recall the formal definition of a 
grammar from the point of view of terms, as it is done in the original paper: 

Definition 3.2 ([l4j). Given a set X of variables, a production is a pair (A, a) , 
where A £ X" (A = Ai . . . A„ is called a non-terminal a £ T{F,X x N)" 
(here X x N denotes the set of numbered instances of variables of X ), and both 
A and a are linear. A grammar is a finite set of productions such that the 
variables occurring in the right-hand sides can be grouped to form instances of 
non-terminals. A step of derivation of a grammar is defined as t — ^ 
aj I J £ [1,'^]} where t is a term word, there is a production {A, a) in G and A^ 
is an instance of A in t. The language generated by a grammar G from axiom 
A is the set of tuples of ground trees L{G,A) = {w £ T^F)^^^ \ A —^q w}. 

Example 3.3. Let A = A1A2 and B = B1B2BS be two non-terminals of respec- 
tive arity 2 and 3. The grammar Gi having rules 

A ^aa I gAi gA^ \ f A\A\ fA^Al \ fB^B^ B3 
B A\A\ fAlAl I gBi gB^ HB^ 

can be represented as a HR grammar in the following way: 




Then a possible production sequence of Gi would be: 
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As expected, these grammars generate the rational languages: 



Theorem 3.4 ( [14] ). A language of tuples of terms is rational if and only if it 
is the language generated by a grammar. 

A rational language of n-tuples of terms can also be seen as a binary relation 
in T{F)P X T{FY, where p+q = n. In this case, given a non-terminal A, we define 
the first and second projections t^i{A) and tt2{A) by the set of variables of A 
referring to the first (resp. second) projection of the relation. A similar notation 
is used for right-hand sides of grammar productions as well. For clarity, we write 
a production {A, a) as (A, 7ri(Q;) x 7r2(a)). Without loss of generality, we always 
consider that A = tti(A)'K2{A) and a = 7ri(Q!)7r2(a). For example, if the axiom 
of a T{F)P X r(i^)9 relation is A = Ai . . . yl„, we can have tti{A) = Ai . . . Ap 
and 7r2(A) = Ap+i . . . Ap+q=n- 

Example 3.5. Grammar Gi from Ex. 13.31 generates, from non-terminal A, a lan- 
guage L{Gi,A) G i?at2, which can be seen as a T{F) x T{F) relation. In this 
case, its rules can be written 

A ^ a X a\gAi X gAi \ fA\Al x fA\Al \ fB^B^ x B^ 
B ^ A\Al X fA\Al I gBi gB^ x hB^ 

4 Term Rewriting Systems 

A (term) rewrite rule is a pair (/,r) G T{F,X)'^ such that Var{r) C Var{l). A 
rewrite rule {l,r) is said to be linear if both / and r are. A rewrite system, or 
more specifically term rewriting system is a set of rewrite rules R. A system R is 
finite when \R\ is finite, and recognizable when the potentially infinite number of 
rules is given as a finite union of pairs U —^ V, where U and V are recognizable 
term languages. Note that we only consider systems where the total number of 
distinct variables is finite. A system is linear when all its rules are linear. We 
denote by Dom{R) (resp. Ran{R)) the set of left-hand sides (resp. right-hand 
sides) of i?, up to a renaming of the variables. The rewriting according to a 
system R is the relation 

— > {{c[lG],c[ra]) €T{F)xT{F) \ {l,r) e R A c e Ci{F) A aeS{F,X)}. 

R 

In case we want to specify that a rule {l,r) is used at some position p (resp. 
set of positions P), we use the notation — >p (resp. — > p ). The reflexive 

Lr l,r 

and transitive closure of -^^^ by composition is called the derivation of R and 
written 

Classification of Rewriting Systems. In the case of words, several natural classes 
of rewriting systems can be distinguished by the way their rules are allowed to 
overlap. In [5], the composition — >^ o of two rewritings is considered, 

and all the different possibilities of overlapping between the right-hand side of 



the first rewrite rule, and the left-hand side of the second one are examined. 
By discarding systems where unwanted overlappings occur, one obtains four 
general families of systems whose derivation is proven rational, the families of 
left, right, prefix and suffix word rewriting systems. Moreover, any system which 
does not belong to one of these families may have a non-rational derivation. As 
a consequence, as terms generalize words, we only need to study the extension 
of these four families of systems to terms: the classes of bottom-up and top-down 
systems, which respectively correspond to left and right systems, and the families 
of prefix and suffix systems. A term rewriting system R (resp. its inverse R~^) 
is said: 

— top-down (rcsp. bottom-up) if any overlapping between a right-hand side r 
and a left-hand side I of R (resp. R^^) is such that r = f[o] and I = oX for 
some (possibly trivial) 1-context f and substitution A, 

— prefix if any overlapping between a right-hand side r and a left-hand side I 
of R is such that I ~ rX or r = Ip for some possibly trivial substitutions A 
and p, 

— suffix if any overlapping between a right-hand side r and a left-hand side I 
of R is such that I ~ l[r] or r ~ r[l] for some possibly trivial 1-contexts f 
and I. 

The following picture illustrates these four kinds of overlappings: 




top-down bottom-up prefix suflix 



Prefix and sufhx systems respectively generalize root and ground rewriting sys- 
tems. Root rewriting systems are already known to be very powerful: indeed, 
they can simulate the execution steps of Turing machines. This implies a direct 
negative result concerning prefix systems. 

Proposition 4.1. Some linear prefix tree rewriting systems have a non rational 
derivation. 

Proof. Let M be a Turing machine with a set of states Q, a tape alphabet 
P and a set of transition rules T C {Q x P U {#} Q x P x {+, -}) (# 
denotes the 'blank' character). Let us build a prefix system on the alphabet 
Q U P U {#}, with variables in {x,y}, where Q is considered binary, P unary 
and # an overloaded symbol of arity either or 1. 

For all pA qB+ e T, Rm has a rule pxAy qBxy, plus a rule pxfj= gBx^ff 
if A = #. 

For all pA qB— e T and C E P, Rm has rules pCxAy — > qxCBy and 
p#Ay ^ plus rules p## ^ and pCx# ^ qxCB# if A = #. 

This system has both overlappings of the kind I = ra and of the kind r — la, for 



some left and right-hand sides I and r and substitution a. It is thus prefix, and 
neither top-down, bottom-up or sufhx in general It is quite clear that computing 
the derivation of Rm is equivalent to computing the reachability relation of 
M, thus is undecidable. Hence — >^ is non-recursive and can obviously not be 



However, contrary to the case of words, where prefix and suffix systems are 
dual and share the same properties, the situation is different in the case of terms. 
The family of ground rewriting systems, which is a sub-family of sufhx systems, 
has already been studied by several authors. In particular, Dauchct and Tison 
[8j showed that the derivations of ground systems can be recognized by a certain 
type of composite automata called ground tree transducers (GTT). Section [6] 
will use similar arguments in order to prove that, more generally, any suffix 
system has a rational derivation. The two remaining families of term rewriting 
systems we consider, namely top-down and bottom-up systems, are dual. The 
next section puts focus on top-down systems, but all the results extend to the 
bottom-up case (see Corollary [53]). 

5 Derivation of Bottom-Up and Top-Down Systems 

This section focuses on the study of top-down term rewriting systems and their 
derivations. For any finite linear top-down system, a grammar of tuples of terms 
generating its derivation relation can be built, which implies that this relation 
is rational. Furthermore, from the shape of the grammar, we observe that the 
derivation of such a system preserves the recognizability of term languages. Dual 
results can be obtained for bottom-up systems: the derivation of a linear bottom- 
up system is rational, and the inverse image of a recognizable term language is 
still recognizable. 

Let us first observe that top-down systems enjoy a kind of monotonicity 
feature. Any rewriting sequence of such systems is equivalent to a sequence 
where the successive rewriting steps occur at non-decreasing positions in the 
input term. We call this top-down rewriting. Let i? be a term rewriting system, 
we define its top-down rewriting by: 



such that the rewriting positions do not decrease along indexes (Vi,j, i < j 
~'{pj < Pi)), and if two successive positions arc equal then the second rewriting 
should not have a trivial left-hand side {{pi = Pi-i) {ui ^ X)). This last 
condition means that, for instance, the sequence 



rational. 



□ 




c[la] 



l,r 



c[ra] — > c[r'{a; i— > ra}] 



is not top-down, because the second rule produces its right-hand side 'higher' 
than the first one. The rewriting steps should be swapped to obtain the top-down 
sequence 

c[la] — > c[r' {x i-^ la}] — > pos(x,c\r']) c[r'{x i-> rcr}]. 

x.r' l.r 

The next lemma expresses the fact that, given any rewriting sequence of a top- 
down system, rewriting steps can always be ordered into an equivalent top-down 
sequence. 

Lemma 5.1. The relations of derivation and top-down derivation of any top- 
down term rewriting system R coincide: —>-*j^ = 

We are now ready to prove the rationality of the derivation of any top-down 
rewriting system. Using this property of top-down systems, it is possible to build 
a grammar which directly generates the derivation of any such system. This 
grammar mimics the way a rational word transducer works, using its control 
state to keep in memory a finite subterm already read or yet to produce. 

Theorem 5.2. Every finite linear top-down term rewriting system R has a ra- 
tional derivation. 

Proof. Let i? be a finite linear top-down system. We denote by O the set of all 
overlappings between left and right parts of rules of R: 

= {teCniF,X) I 3seCi{F,X), 

u e T{F,X)'',s[t] e Ran{R) A t[u] G Dom{R) }. (1) 

Remark that □ belongs to O. We will now build a grammar G whose language is 
exactly the derivation of R. Its finite set of non-terminals is {<*>} U Q, where 
Q = { <t> = <t>i ... <t>„+i \ t e O n Cn{F)} and, for all <i>G Q, 
772 (<i>) is a single variable. The production rules of G are of four types. 
Type (1): V / £ F^,, 

<□> ^ /<□>}...<□>!' X / <n>^ ... <n>^ 
<*> f <*>^ • ■ • <*>" 

Type (2): V i G O n C„(F), t[u] G O n C,n{F), 

<t> u[T:i{<t[u]>)] X Tr2{<t[u\>) 
Type (3): V t[u] G O, teOn Ce{F) (necessarily {ui, ...Ui}C O), 

<t[u]> 7ri{<Ul>) . . .TTl{<Ui>) X t[K2{<Ul>) . . .TT2{<Ui>)] 

Type (4): V {t[u], s[u]) G i?, v — vi . . .vg (necessarily {t, vi, ^ O), 



«> ua X .s[7r2(<wi>) . . . 7r2(<w^>)] 




Fig. 1. grammar associated to a top-down system. 

where cr is a variable renaming such that for any variable x of u, cr(x) = <Vi>j 
if X is the j-th variable to appear in Vi (from left to right), and u{x) — <*> if 
X does not appear in any of the Vi. Figure [T] illustrates the four types of rules. 

Intuitively, the role of this substitution is to gather into the same non- 
terminal or hyperedge all the variables of u belonging to the same Vi, while 
respecting the order in which these variables appear in Vi. This way, a correct 
instantiation of non-terminals of G is ensured. If a variable of u does not ap- 
pear at all in v, then it means that a whole input subtree is 'discarded' by the 
rewriting rule being applied. Thus the grammar should accept any subtree to be 
generated at this position, which is the role of the unary non-terminal <*>. 

For simplicity, we will only consider type (4) rules in which t^vi, . . .v„ are 
maximal. The other cases can be simulated by suitable finite compositions of 
rules of types (2), (3) and (4). □ 



Example 5.3. Consider the linear top-down system R over the alphabet F = 
g(i),/i(i),a(o)} with a unique rule fgxgy hfxy. The corresponding 
grammar is the grammar of Ex. 13.51 where each non-terminal stands for one of 
the possible overlappings of rules of R: A stands for □ and B for /□in2. Note 
that type (4) rules with non-maximal overlappings have been discarded. This 
example also illustrates the fact that the inverse image of a recognizable term 
language by the derivation of a linear top-down system is not recognizable in 
general: for instance, the image by Rq^ of h*faa is {h* fg'^ag'^a \ n > 0}, which 
is not recognizable. 

We will now mention a property of top-down systems, which has been known 
for the past few years for larger classes of systems. 

Proposition 5.4. The image of any recognizable term language by the deriva- 
tion relation of a finite linear top-down term rewriting system is recognizable. 

Top-down systems form a strict subfamily of generalized semi-monadic term 
rewriting systems [10] , which is itself a strict subfamily of right-linear finite path 



overlapping systems [15j . Both classes have been proven to preserve recogniz- 
abihty. As a consequence, this is also the case for top-down systems. However, 
it should be mentioned that neither of these classes has a rational derivation. 
Indeed, it is quite easy to find a generalized semi-monadic system whose deriva- 
tion cannot be recognized by any finite mechanism. For instance, the general- 
ized semi-monadic system whose unique rule is gx fgfx clearly has a non- 
rational derivation: its intersection with the rational relation ga x f*gf*a is 
ga X {f"gf"a\n > 0}. By the usual pumping arguments (adapted to this new 
setting), this relation is not rational. 

Finally, please note that the inverse of a top-down system is, by definition, 
bottom-up. For any top-down system wc can build a grammar G recognizing 
— Thus, the grammar 7r2(G)7ri(G') obtained by swapping both projections of 
G generates the derivation — >-Jj-i of the bottom-up system R~^. Inverse recog- 
nizability preservation follows. 

Corollary 5.5. Every finite linear bottom-up term rewriting system R has a 
rational derivation and the inverse image by of any recognizable term 

language is recognizable. 

6 Derivation of Suffix Systems 

This section presents a study of the derivation relations of suffix term rewriting 
systems. After introducing a property related to the notion of suffix rewriting, 
we show that the derivation of any recognizable linear suffix system is rational. 
Finally, we prove that the image or inverse image of any recognizable term 
language by the derivation of a recognizable linear suffix system is recognizable, 
and that it is possible to build a tree automaton accepting it. 

Definition 6.1. The suffix rewriting of a term rewriting system R is the relation 

= {{c[la],c[ra]) eT{F,Xf \ {l,r) e R A c e Ci(F) 

A a e S'(0, a:) bijective} 

(a bijective substitution in S{$,X) is a bijective variable renaming over X). 

Suffix systems have a specific behaviour with respect to sufl3.x rewriting. 
Indeed, the derivation of any input tree i by a sufRx system can always be 
decomposed in two phases. First, a prefix t of t is read, and several steps of 
suffix rewriting can be applied to it. Once this first sequence is over, t has been 
rewritten into a prefix s of s, never to be modified anymore. In a second time, 
the rest of t is derived in the same fashion, starting with suffix rewriting of a 
prefix of the remaining input. As a consequence, the derivation of a suffix system 
is equivalent to its 'iterated' suffix derivation. 



Lemma 6.2. For any suffix term rewriting system R, 



3 s,te T(F, X), a,T e S{F, X) s ^ sa A t = tr 
A s t A Vx G Var(s) Ci Var(t), a(x) t(x). 

B. R 

Another interesting property is that, for any recognizable system, a sufRx 
rewriting sequence is always equivalent to a sequence in two parts, where the 
first part only consumes suffix subterms of the input term, and the second part 
only produces new suffix subterms in their place. 

Lemma 6.3. For all recognizable linear term rewriting system R over F and 
X , there exist a finite ranked alphabet Q and three finite rewriting systems 

- i?_ C {px fpiXi . . .PnXn \ f ^ F, . . . ,p„ e Q, . . . ,X„ e X*} 

- R= Q {px ^ qy \ p,q € Q, x,y e X*} 

- -R+ C {fpiXi . ..p„X„ ^ px \ f e F, p,pi, ...,Pn & Q, X,Xi,...,Xn ^ X*} 

such that s — H t <f=^ s — H o — h t. 

R i?+uH= fl_ui?= 

Lemma [6.31 can be reformulated in the following way: a pair {s,t) of terms 
belongs to the suffix derivation of a system R if and only if there is a context c 
such that s ~ c[si . . . s„], i = c[ti . . . i„] and for all i € [1, n], there is a term QiXi 
such that Si-i\*j^^^j^^qiXi and qiXi-^*fi_uj^^ti. 

Theorem 6.4. Every recognizable linear suffix term rewriting system R has a 
rational derivation. 

Proof. Let i? be a recognizable linear suffix system on T{F, X). Let R= and 
R- be the rewriting systems mentioned in Lemma 16.31 Let iV be a set of pairs 
of the form u\v where u and v are two linear term words over Ran{R^ U R=)* 
and Dom{R- U R=)* respectively. Note that Ran and Dom are defined up to 
a renaming of the variables. We can thus impose that u and v share the same 
set of variables {Var(u) = Var{v)), and there is no pair of strict subwords u' 
and v' of u and v such that Var(u') ^ Var{v') (i.e one should not be able to 
split u\v in two correct non-terminals). This, together with the facts that F is 
finite and u and v are linear, implies that N is finite for some fixed, standard 
variable renaming. Thus, given an axiom J, we can build a grammar G whose set 
of non-terminals is U {/, /'}, having the following finite sets of productions: 

I fll-.-Ii X fll...I^ and /' — > //'^../'" (2) 

V e Dom{R_ U R=) n Ran{R+ U 

/ — > px\px (3) 




V u' u, V ^Jj^ v' , u' e Ran{R+)*, v' G Dom{R^)* , 

u\v — > u'\v' (4) 

V Ui =PlXi . ..PiX.i, U2 = Pj + lXj + i . . .PnXn, 

V = qiui . . . q^Vm , fpi+iXi+1 . . . pjXj R+ px, 

UipXU2\v > ^1 . . ./i; . . + i . . X Vi...l'ra (5) 

V M = pixi . . .p„Xn, vi = qiUi . . .qiUi, 

V2 = qj+iVj+i ■ ■ ■ qinVm, qyR- fqi+Wi+i ■ ■ ■ qjVj, 

u\viqyV2 > ^J.l...^J.n X . . . {fUi+i . . . Vj) Vj+i . . . U^n (6) 

In rules ([5]) and ([6]), all the {lJ-k)ke[i,n] a-nd {i^k)ke[i,m] a-re variables belonging 
to instances of non-terminals G N where u' and v' are built from terms 

iPkXk)k£li,n] and {qkyk)k£li,m] respectively. Variables /ii to /i„ (resp. vi to Vrn) 
appear only in the first (resp. second) projection of any non-terminal. Note that 
this instantiation is unique, by construction of the set N. It is also always possible 
since every rule of R is, by hypothesis, linear. 

Call p the substitution which maps each non-terminal variable {u\v)i to the 
term {u)i if i G [1, \u\] and to {v)i if i G [\u\ + 1, \u\ + \v\], and each non-terminal 
variable (/j)je[i,2] to a variable Xi. It is clear from the rules of Go that: 

/ s X t so — H o — H tp. (7) 

Go ii+Ui?,= R-UB.= 

We will not detail the proof of this observation. Notice that this grammar works 
in a very similar way to a ground tree transducer, which is the formalism used 
by [8] to recognize the derivation of a ground system. The only difference is that 
we keep track of the variables appearing in the left and right projections of the 
relation, so as to be able to resume the rewriting at relevant positions. Now add 
to Go the set of rules 

\f X £ X such that xRpx, qxRx, px\qx — > / (8) 

px\ ^ I' (9) 

and call this new grammar G. These last rules allow the derivation to go on 
properly after a first sequence of suffix rewritings has taken place, by creating 
new instances of the axiom between leaves where the same variable would appear. 
By Lemma [6 ■2[ G generates — □ 

Proposition 6.5. The image and inverse image of any recognizable term lan- 
guage by the derivation of a finite linear suffix term rewriting system is recog- 
nizable. 



Proof sketch. Once a grammar generating the derivation of a suffix system R 
is built, according to the previous proof, it is not difficult to synchronize the 
left projection of this grammar with any finite top-down tree automaton A. We 



thus obtain a new grammar, whose second projection yields a finite automaton 
accepting the image of L{A) by This is symmetrical, hence the converse. □ 
We will illustrate the fact that suffix systems are strictly more general than 
ground systems on the following simple example. 

Example 6.6. Consider the finite suffix system R = {fxy — > fyx, a — > go} 
over the ranked alphabet {Z'^-*, g'^-', a'"'}. The first rule of R allows to swap at 
any time both children of an /-node. This somehow expresses the commutativity 
of /. The derivation of R (restricted for the sake of clarity to {fg*ag*a)^) is the 
relation {{fg'^ag'^a, fg''ag'"'a)\m,n > 0} U {(/g™a5"a, /g™+ia5"a) | m, n > 
0} U {{fg"^ag"a, fg"'-ag"'^-^a) \ m,n > 0}, which is not recognizable by a ground 
tree transducer. 

Furthermore, we claim that the transition graph of this rewriting system 
is not isomorphic to the transition system of any (recognizable) ground term 
rewriting system as defined in |12I7| . Note: the transition graph of a rewriting 
system is the graph whose vertices are the terms from the domain or range of 
the system, and whose edges are all the pairs (s, t) such that s can be rewritten 
to t in one step. 

7 Conclusion 

This paper extends the left, right, prefix and suffix word rewriting systems de- 
fined in [5] to bottom-up, top-down, suffix and prefix term rewriting systems. The 
derivation relation of the three first types of systems can be generated by finite 
graph grammars, while systems of the fourth type have a non recursive derivation 
in general. We also stated some recogniz ability preservation properties of these 
classes of systems, and provided effective constructions in each case. Although 
|15j defines a class of recognizability-preserving rewriting systems strictly more 
general than top-down systems, they do not aim to provide a construction for 
the derivation relation itself, which is indeed not rational. As for suffix systems, 
to our knowledge, no comparable class of recognizability-preserving systems has 
been defined yet. 

This study puts in practical use the notion of rationality defined in |14| . 
which nicely extends the usual rational relations on words, even though some of 
their key properties are missing, like the closure by composition or systematic 
preservation of recognizability. However, this formalism is an interesting and 
powerful work basis for the study of binary relations on terms, especially thanks 
to the fact that it is general enough to extend asynchronous transducers (which 
is not the case of most other formalisms). Still, depending on one's objectives, 
it might be necessary to devise a more restricted notion of rational relations 
on terms, which would be closed under composition or preserve recognizability 
(or both) . Note that [M] contains the definition of such a subfamily of relations 
(called rational transductions) . However, it can be shown that the derivations of 
some top-down systems do not belong to this class. 

Finally, it could be interesting to look for extensions to some of the existing 
works previously mentioned. First, one may try to elaborate actual verification 



methods using our systems to model transitions, and recognizable term languages 
for sets of configurations, along the ideas of regular model-checking [5]. Indeed, 
being able to effectively build the transitive closure of the system's transition 
relation and compute the image of regular sets of configurations could lead to 
interesting results. Second, the definitions from |12| and [7] about transition 
graphs of ground systems, should extend smoothly to the case of suffix systems. 
Thus, it would be meaningful to determine whether part or all of their results 
extend to this new family, and in particular whether the transition graphs of 
suffix systems have a decidable first order theory with reachability Note that, 
as illustrated in Ex. 16. 6i we suspect that the transition graphs of suffix systems 
strictly include the former families of graphs. 
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A Appendix: Proof details 



We will present here the missing proof to Lemma [5A1 a construction which can 
be used as an alternative proof for Proposition 15.41 the proofs to Lemmas 16.21 
and 16.31 and a detailed construction for Proposition 16.51 

A.l Derivation of Bottom-up and Top-down Systems 

Lemma 15.11 The relations of derivation and top-down derivation of any top- 
down term rewriting system R coincide: 

Proof. By definition, C — It remains to prove the converse, by using the 
same technique as in [S]. As — »^ = ~*Jj_(xxX)' ni^Y assume that R n {X x 
X) = 0. Then, we can sort any derivation into a top-down derivation, as defined 
above, by applying an alternative version of the bubble sort algorithm in which 
removal and addition of elements are allowed. To do so, we use the following 
inclusions: 



U.V 


>p 


o 


x.v' 


p 


^ -^p 

X,'V 


o 


u.v 


2>p 






(10) 


u,v 


>p 


o 


u' .V' 


'. p 


u ,v 


o 


u,v 


2>P 


with It, u' 


ix 


(11) 


u.v 


>p 


o 


x.v' 


p 


^ ^p 

x,v 


o 


u.v 


2>p 


with u' , v' 


ix 


(12) 


u.v 


>p 


o 


x.v' 


p 


^ -^p 

x,v' 


o 


u.v 


2>p 


U >p o 

u.v 


x^v' 


(13) 



where — >>p (resp. ^>p) denotes rewriting at any position greater than (rcsp. 
greater or equal to) p, and ^2>p ^2>p denotes multi-step rewriting at any 
set of such positions. Let us now prove the first inclusion. Let 

r > p S ^ q t 
u,v x.v' 

with p ^ q. One can find 1-contcxts f and s and substitutions p and a such 
that r = f[up\, s = f[vp\ — s[xa] and t = s[v'a], with pos{n,f) — p and 
pos(n, s) = q. As p ^ q and by hypothesis on R, there is a substitution 7 such 
that f = s[x^] and a ~ "f[vp\. It is thus possible to swap the rewriting steps 
between r and t: 

r = {s[xj])[up] — >q {s[v'j])[up] >p {s[vj])[vp] = t, 

x.v' u,v 

where P C 2^9 is the set of positions at which the special variable □ occurs in 

s[v'^] {P ^ pos{n,s[v'^])). Hence 



and inclusion pop is proven. More generally, consider any two-steps rewriting 



T > p S > q t 

with p > q. By definition there must exist 1-contexts f and s and substitutions p 
and a such that r ~ r[up\, s — f [wp] = s[ucr] and t = s[w'(t], with pos(n, f) = {p} 
and pos(n,s) = {g}. The hypotheses we made on i? imply certain restrictions 
on the structure of configuration s. As R is descending, s[u'] ^ f. So there must 
be a substitution 7 such that f = s[w'7] and cr = 7[fp]. Hence 

r = [s{ui\)\up\ ~-^„ (s[v' -i])[up] — >p {s[v'-f])[vp\ = t, 

u ,v u.v 

where P = pos(n, s[u'7]). Thus 

r — >„ o — >p t. 

a:, I)' ^^,^) 

As P e 2^'?, inclusion ([TTjl is proven, li v' ^ X 01 v' = y & X /\ j{y) is 
not trivial, then P G 2^', so (|12p and the first part of ^3]) are true. Finally, if 
u € X, v' G X and 7(1'') is trivial, then r = fp, t = scr, f = s[it'], = cr, hence 

r = {s[u'])p — > q {s[v[u']])p — > p c 2>i is[v])p = tn 

x,v u' ,x' 

Proposition [574l The image of any recognizable term language by the derivation 
of a linear top-down term rewriting system is recognizable. 

Proof. First note that the domain of the derivation of a descending system is 
T(F), and its range is recognizable: the grammar constructed in the proof of 
Theorem 15.21 only has non-terminals whose second projection is reduced to a 
single variable. 

Let L be a recognizable term language accepted by some top-down tree au- 
tomaton A with a set of control states Q. Let i? be a linear descending system. 
By the previous construction, we are able to build a rational grammar G recog- 
nizing -^p. Let us now define a "product grammar" Ga whose domain is L and 
whose range is — >Jj (L). Ga's non-terminals will be of the form 

{Ni,qi)... (iV„, qn)iNn+i,qi . . . g„), (14) 

noted Nq^,,,q^ for short. For each production 

Ni... Nn+l > ti. ..tn S 

of G, the product grammar Ga will have all possible productions 

Nq^...q^ > . . . t„ S 

where the m-contextual term word ti . . . i„ is partially accepted by A (see Sect. 
[2|) with initial control word {qi . . . qn) and final control word {q[ ... 5^). Further- 
more, t[ . . . t'^ is obtained from ti . . . tn by pairing each of its m variables with 



the associated component of the final control word, and s' is obtained from s by 
pairing each of its variables with a word on Q* , so as to complete every instance 
of Ga^s non-terminals according to p^ . The result of this is that a pair {s,t) 
belongs to L{Ga) if and only if (s, t) G L{G) and s & L. In other words, the 
second projection of Ga is exactly the set of terms who are the image of some 
term in L by so 7r2(L(G^)) = (L). By forgetting the left projection 
of every grammar production, one gets a grammar where all non-terminals have 
an arity of 1 . Such grammars are called regular tree grammars and generate rec- 
ognizable term languages. In this case, we obtain a regular grammar generating 



A. 2 Derivation of Suffix Systems 

Lemma 16.21 For any suffix term rewriting system R, 



s — > t, k>Q 
R 



3 s,t e T{F,X), a,Te S{F,X), k' > 0, 



s = SCT A t ^ tr A s — H t 

R 

A Vx e Var{s) f) Var{t) (j{x) t{x). 



Proof. First note that, since — C — the inverse implication is trivial. It 
only remains to prove the direct implication. We will reason by induction on k: 

fc = 1: s —>-j^ t implies that there is a context c and a substitution a such that 
s ~ c[la] and t = c[r(7]. Thus by definition of suffix rewriting c[l] — H^c[r], 
and of course for all variable x, a-{x) — >^ a{x). 

k ^ k + 1: let s — s' — >„ t with IRr, s = sa and s' = s'a' . Two cases: 

•p G Pos{s'): if there exists a context c such that s' — c[l], then we have 
s — h|. s' —Mj^ c[r] and the condition is verified with k' = k and t = (c\r])a' . 
If not, then there must exist a context c and a non-trivial substitution 
u' such that s' = (c[Z])tj'. As R is sufiix and as, by induction hypothesis, 
s^^s', there must exist to such that s = (c[Z])a; and for all variable x 
common to / and r, a;(a;)— H Jjw'(a;). We can then write s as {c\l])uja, t as 
(c[r])ti;'CT', and verify the condition is true with k' = \. 

p ^ Pos{s'): by induction hypothesis, one can find fc' > such that for all x 

- ~ <k — k' 

common to s and s', cr(a;) — >^ (^'i^)- Furthermore, by applying rule 
(/, r) to one of the a-'{x), we get cr'{x) — ^ t{x). We thus have t = s't, 

s^^s' and a{x) — ^^ ^^ t{x) for all x in both s and s', which verifies 
the condition and concludes the proof. □ 

Lemma l6.3l For all recognizable linear term rewriting system R over F and X , 
there exist a finite ranked alphabet Q and three finite rewriting systems 

- R+ 'i= {PX ^ fpiXl . . -PnXn \ f e F, p,pi, ...,Pn £ Q, X,Xi,...,Xn G 

X*}, 



- R= C {px qy \ p,q e Q, x,y e X*}, 

- -R- C {fpiXi . . .PnXn px \ f e F, p,Pi,. 

X*} 



,Pn e 



such that 



R+uR= 



o — Hi. 

R-llR= 



Proof. Let i? be a recognizable linear rewriting system over F and X. Let 
(Aj, ;] be / pairs of finite top-down automata, each accepting a set of 

rules of R. The set of states of Ai (resp. Bi) will be referred to as Pi (resp. 
Qi). Without losing generality, we will suppose that all Pi and Qi are pairwise 
disjoint. We also define P as the union of all Pi and Q as the union of all Qi. 
For all state p S PUQ, define i'{p) as the set of all possible variable boundaries 
in the language accepted with initial state p. The way we defined recognizable 
linear systems, i.e. with a finite number of variables, we can consider without 
losing generality that = 1 for all p. 

In a first step, we define a new rewriting system R' on FUPUQ and X. The 
state alphabets P and Q are considered as ranked alphabets, where the arity of 
any of their symbols is equal to the number of variables appearing in the terms 
of its associated term language (which can be supposed unique without losing 
generality). For instance, suppose that from some state p, automaton A accepts 
the language g* fxy. Then, p will be considered as a binary symbol. 

We give R' the following set of rules. For all rule pf ^ pi . . .pm in some A, 
such that v{p) — v{pi) . . . v{pm) ~ xi . . . a;„, we have: 

fpiv{pi) . ..Prn'^iPm) ^ pXl . . . X„ £ R' . 

Rules of this kind allow us to consume a left-hand side of a rule in the input 
tree. For all rule qf —> qi . . . qm in some Bi such that ~ v{qi) . . . ^{qm) = 
Xi . . .Xn, we have: 

qXi... Xn fqiv{qi) ■ ■ ■ qrnV{qrn) S R' . 

Rules of this kind allow us to produce a right-hand side of a rule whose left-hand 
side has been previously consumed. Finally, for all pair (poj^o) of initial states 
of some pair (A^, Bi), with ;/(po) = xi . . . Xn and i'((7o) = x^-^ . . . Xk^ we have: 

POXl...Xn qoXki-.-Xk^ £ R! . 

This simulates the application of a rewrite rule from L{Ai) x L{Bi) by initiating 
a run of automaton Bi when a successful 'reverse run' of Ai has been achieved. 
When restricted to T{F)^, the derivation of R' coincides with -^^: 

ys,t eT{F), s t s^t. (15) 

R R' 



The proof of this property is not difficult and will thus not be detailed here. In 
the rest of the proof, p, q and all variations thereof designate automata control 



states in P U Q, variable words in X* are denoted by u, v, Ui, Vi, . . and ct is a 
variable renaming. 

In a second step, we define i?_ as {fpiUi . . .pnUn R' pu} ('consuming' rules), 
i?+ as {qv R' fqivi . . . QriVn} ('producing' rules) and R^ as the smallest binary 
relation in T{F,X)'^ closed by the following inference rules: 

(1) ^' (2) 



pu R- pu pu i?= qv 



pu R^ qv qu R= q'v q'v i?= q" z 



pua R= qva qu i?= q"z 

pu fpiui . . . pnUn fqivi . . . q^Vn R+ qv Vi, PiUj R= qjVj 

pu i?= qv 



(5) 



Since F, P, Q and X are finite, and each symbol p of P U Q has a definite arity, 
then P= is finite and effectively computable. Let us mention a simple property 
of R= : 

W pu,t,qv T{F, X), pu — H t — H qv =J> puR=qv. (16) 

This can be proved by induction on the nesting depth k of term t: 

k ~ 0: no rule of P_ or P-|_ is applied, thus pu^*j^_qv. Then, by inference 
rule (4), the property is true, 
fc => /c + 1: let us decompose the derivation sequence between s and t: 

pu H p'u' H fpiUi . . .pnUn 

R- 

— ^ o fqivi . . . qnVn H q'v' Hqv. 

R-UR= R+UR= R+ R= 

By induction hypothesis, we know that fpiui . . ■PnUn^fi_fqiVi . . . qnUm so 
by inference rule (5), p'u' P= q'v' . Finally, by inference rule (4), puR= qv. 

It remains to prove that R= and i?_ verify the lemma. By ([T5)) . it suffices 
to show that ^/j^u-R- ° ^/j-ui?- ^ ^*R'- Proving the direct inclusion is 
equivalent to proving i?= C — >i|j/, which can be seen easily by observing the 
inference rules given above. To prove the converse we will establish, by induction 
on k, the following property: 

Vfc, s A t =^ 3 c e Ci(P), (<z,uO,e[„] e T{F,X), 



S H c[qiUi . . . qnUn] H t. 

R+\JR= R-\JR= 



k — 0: s — t — c, n — 0, so trivially s ^|j^u/J- ~^R-UR- ^- 



fc =J> fc + 1: let s t ^t' with IR'r. By induction hypothesis, 
■s -^*R+uR= cfgiui . . .qnUn] ~M*R_uR^ t = c[ii . . .i„]. 

There are three possible cases: 

— 3c', t = c'[ii . . . t„ Z]: in this case s can be written c'[si . . . s„ Z], so the 
following derivation sequence is valid: 

S c[qiUi . . .QnUnl] C [qiUi . . . QnUn Qiu] 

— H c' [qiui . . . qnUn Qrv] {hccause I R' r) 

R= 

H c'[qiui . . .QnUnr] c'[ti . . .t,ir] = t' . 

R- R-UR= 

— 3i, ti = ti[l]: the only way to produce ti from qiUi is along the steps 

qiUi^R_uRji[qiu]^jl_ U. 
Thus the following derivation is valid: 

s c[qiui . . . qnUn] ^ c[qiui . . . ti[qiu] . . . g„u„] 

R+\JR= R-UR= 

— H c[qiui . . . ti[qrV] . . . qnUn] — H t'. 
R= R^ 

— 3c' , j > i > 1, t = c'[ti . . .tiltj+i . . An], with I = l[ti+i . . .tj]. The 
derivation sequence between s and t' can then be written 

S — ^ c[qiUi . . . qnUn] — ^ c[ti...tn] 

R+UR= R-UR= 

c[ti ...t, . . . q'ju'j tj + i ...tn] 

R-i- 

H c'[ti . . .UqiUtj + i . . .t„] H c'[ti . . .UqrVtj+i . . .tn] 

_R,+ /?,= 

^ c'[ti . . .tirtj+i . . .tn] = t' 

R— 

Notice that for aU k £ [i + 1, j], <?fc'«fe-^jj_uii=^fe~^is+ui?,= '2'fe"fc- Thus, by 
(fTB|) . qkUk R= q'^u'^., hence 

s ^ c[giwi . . . g„w„] c'[(7iui . . . . . . g„w„] 

— H c'[ti . . . r . . . t„] = t'. 



This concludes the proof of lemma 16.31 □ 

Proposition 16.51 The image and inverse image of any recognizable term lan- 
guage by the derivation of a recognizable linear suffix term rewriting system is 
recognizable. 



Proof. Let Rq be a recognizable linear sufRx term rewriting system on T{F), R 
an equivalent normalized system on T{F U F'), with F' a set of new function 
symbols. Let G be the grammar recognizing its derivation, built as in the proof 
to Theorem 16.41 N U {!,!'} its set of non-terminals. Let A be a finite non- 
deterministic top-down tree automaton accepting a recognizable language L. 
Suppose Qa is the set of control states of A, disjoint from F and X, qq G Qa 
its unique initial state. Let Q'^ be a disjoint copy of Q^, we define the following 
grammar Ga having non-terminals in (3^UQ^U((5^ xi?an(i?+Ui?'))* |-Dom(i?_U 
R')* and the following set of production rules: 

— For all rule rf fri . . .Tn'. 

r ^ /(ri)i...(r„)i X /(ri)2...K)2 (17) 
r' -^fr[...r'„ (18) 

— For aU r e Qa, px G Dom{R) n Ran{R): 

r — > {r,px)\px, (19) 

— For all rule piXi . . .p„a;„|w — > p'\x'\ ■ ■ -p'nX^-Xv' of type (U) in G and state 
word ri . . . r„ £ Q*^. 

(ri,pixi) . . . (r„,p„x„)|?; — > (ri,pia;i) . . . (r„,p',.T'J|u' (20) 

— For all rule u\pxu2\v — > si{f^i+i . . . ^j)s2 x t of type ([5]) of G and states 
word ri. ..nr rj+i . . . r„ e Q^: 

{ri,pi) . . .{rn,Pn)\v — >s'xt' (21) 



where rf — s-^ ffi+i ■ ■ -fj and s' and t' are obtained from s and t by replacing 
each occurrence of pk in a non-terminal variable by {rk,Pk)- 
— For all rule piXi . . .pnXn\v ^ sxt oi type ^ of G and word n . . .rn £ Qa- 

{n,pi)...{rn,Pn)\v s' xt' (22) 



where s' and t' are obtained from s and t by replacing each occurrence of pi 
in a non-terminal variable by {ri,pi). 
— Finally, for all xRpx, qxRx, x ^ X and r g Qa'- 

{r,px)\qx > r (23) 

ir,px)\ >r'. (24) 

One can see that, starting from non-terminal goj grammar Ga only accepts pairs 
(s, t) such that s — >Jj t and s £ L. Thus the set of all t such that (s, t) is generated 
by Ga from qo is exactly the image of L by the derivation of R: 



LiGA.qo) = -^{L). 



An automaton recognizing this set of terms can be built by taking the right 
projection of Gai and by treating each non-terminal variable as a unary non- 
terminal. The rules of this automaton are given by the rules of Ga broken into 
several rules over these non-terminals of length 1 . 

Note that this proof is totally symmetrical, and that the synchronization of 
G by a finite automaton A could be done on the second projection instead of 
the first. Thus the converse result. □ 



